Model Selection

Low VRAM optimization

# Low VRAM optimization

Qwen2.5 Omni 7B AWQ

Qwen2.5-Omni is an end-to-end multimodal model capable of perceiving multiple modalities including text, images, audio, and video, while generating text and natural speech responses in a streaming manner.

Multimodal Fusion

Transformers English

LTX Video 0.9.7 Dev

The first real-time high-quality video generation model based on DiT architecture, capable of generating 1216×704 resolution videos at 30fps

Video Processing English

Auraflow DomoKun LoRA Rank8

A standard PEFT LoRA model trained on fal/AuraFlow, specializing in text-to-image and image-to-image generation tasks for Domo-kun characters.

Image Generation

FLUX Hyperscale Fused

FLUX is a text-to-image generation model that integrates 5 high-quality fine-tuned adapters, capable of producing images in various styles

Image Generation English

Wan2.1 Fun 1.3B Control

Wan2.1-Fun-1.3B is a text-to-video generation model that supports multi-resolution training and first/last frame prediction.

Text-to-Video Supports Multiple Languages

Origami WanLora

This is a LoRA adapter based on the Wan2.1-T2V-14B model, designed for generating origami-style videos.

Text-to-Video English

Phi3 Uncensored Chat

A fine-tuned version based on microsoft/phi-3-mini-4k-instruct, specifically designed for role-playing dialogues with various characters

Large Language Model

Transformers English

Wan2.1 Fun 1.3B InP

Wan2.1-Fun-1.3B is a text-to-video generation model developed by Alibaba PAI team, supporting multi-resolution training and first/last frame prediction.

Text-to-Video Supports Multiple Languages

Steamboat Willie 1.3b

A LoRA model trained on Steamboat Willie animation clips for generating text-to-video content in golden age animation style

CogView4-6B is a text-to-image model based on the GLM-4-9B foundation model, supporting both Chinese and English, capable of generating high-quality images.

Text-to-Image Supports Multiple Languages

Deepseek R1 AWQ

AWQ quantized version of DeepSeek R1 model, optimized for float16 overflow issues and supports efficient inference deployment

Large Language Model

Transformers Supports Multiple Languages

cognitivecomputations

Omnigen V1 Bnb 8bit

The 8-bit quantized version of OmniGen-v1, suitable for text-to-image and image-to-image tasks, supporting multimodal input.

Stable Diffusion V3 5 Large GGUF

Stable Diffusion 3.5 Large Model is a multimodal diffusion transformer (MMDiT) text-to-image model, with significant improvements in image quality, text layout, complex prompt understanding, and resource efficiency.

Text-to-Image English

Flux Actors Face Inset Cig Cards LoKr

A LyCORIS adapter based on FLUX.1-dev, specializing in text-to-image generation tasks, particularly suitable for work environments.

Image Generation

Flux Fusion V2 4step Merge Gguf Nf4

A text-to-image model formed by merging Schnell, fine-tuned Dev, and Hyper, recommended steps 4-8, with significant quality improvement at 4 steps

Text-to-Image English

Neuraldaredevil 8B Abliterated GGUF

This is a quantized version of the NeuralDaredevil-8B-abliterated model, providing model files of various quantization types, suitable for users with different hardware conditions and requirements.

Large Language Model

ALMA-7B-Ja-V2 is a machine translation model supporting Japanese-English bidirectional translation, with improved performance through additional training on the previous version.

Machine Translation

Transformers Supports Multiple Languages

Ether Real Mix is a text-to-image generation model based on stable diffusion technology, focusing on generating high-quality, artistic-style images.

Image Generation

RWKV is a large language model combining the advantages of RNN and Transformer, supporting efficient training and fast inference with unlimited context processing capability.

Large Language Model

Cool Japan Diffusion 2 1 2

An anime-style text-to-image model fine-tuned on Stable Diffusion, specializing in 'Cool Japan' cultural content

Image Generation

Genji Python 6B Split

GPT-J 6B fine-tuned model for Python code generation, specialized in Python programming assistance

Large Language Model

Transformers English

Gpt J 6B Vietnamese News

This is a 6B-parameter Vietnamese causal language model based on GPT-J architecture, specifically trained for Vietnamese news content.

Large Language Model

Transformers Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase